Pesquisa | Portal Regional da BVS

1.

Positive Effect of Super-Resolved Structural Magnetic Resonance Imaging for Mild Cognitive Impairment Detection.

Grigas, Ovidijus; Damasevicius, Robertas; Maskeliunas, Rytis.

Brain Sci ; 14(4)2024 Apr 14.

Artigo em Inglês | MEDLINE | ID: mdl-38672031

RESUMO

This paper presents a novel approach to improving the detection of mild cognitive impairment (MCI) through the use of super-resolved structural magnetic resonance imaging (MRI) and optimized deep learning models. The study introduces enhancements to the perceptual quality of super-resolved 2D structural MRI images using advanced loss functions, modifications to the upscaler part of the generator, and experiments with various discriminators within a generative adversarial training setting. It empirically demonstrates the effectiveness of super-resolution in the MCI detection task, showcasing performance improvements across different state-of-the-art classification models. The paper also addresses the challenge of accurately capturing perceptual image quality, particularly when images contain checkerboard artifacts, and proposes a methodology that incorporates hyperparameter optimization through a Pareto optimal Markov blanket (POMB). This approach systematically explores the hyperparameter space, focusing on reducing overfitting and enhancing model generalizability. The research findings contribute to the field by demonstrating that super-resolution can significantly improve the quality of MRI images for MCI detection, highlighting the importance of choosing an adequate discriminator and the potential of super-resolution as a preprocessing step to boost classification model performance.

2.

Smartphone-Based Voice Wellness Index Application for Dysphonia Screening and Assessment: Development and Reliability.

Uloza, Virgilijus; Ulozaite-Staniene, Nora; Petrauskas, Tadas; Pribuisis, Kipras; Uloziene, Ingrida; Blazauskas, Tomas; Damasevicius, Robertas; Maskeliunas, Rytis.

J Voice ; 2023 Nov 16.

Artigo em Inglês | MEDLINE | ID: mdl-37980209

RESUMO

OBJECTIVE: This study aimed to develop a Voice Wellness Index (VWI) application combining the acoustic voice quality index (AVQI) and glottal function index (GFI) data and to evaluate its reliability in quantitative voice assessment and normal versus pathological voice differentiation. STUDY DESIGN: Cross-sectional study. METHODS: A total of 135 adult participants (86 patients with voice disorders and 49 patients with normal voices) were included in this study. Five iOS and Android smartphones with the "Voice Wellness Index" app installed were used to estimate VWI. The VWI data obtained using smartphones were compared with VWI measurements computed from voice recordings collected from a reference studio microphone. The diagnostic efficacy of VWI in differentiating between normal and disordered voices was assessed using receiver operating characteristics (ROC). RESULTS: With a Cronbach's alpha of 0.972 and an ICC of 0.972 (0.964-0.979), the VWI scores of the individual smartphones demonstrated remarkable inter-smartphone agreement and reliability. The VWI data obtained from different smartphones and a studio microphone showed nearly perfect direct linear correlations (r = 0.993-0.998). Depending on the individual smartphone device used, the cutoff scores of VWI related to differentiating between normal and pathological voice groups were calculated as 5.6-6.0 with the best balance between sensitivity (94.10-95.15%) and specificity (93.68-95.72%), The diagnostic accuracy was excellent in all cases, with an area under the curve (AUC) of 0.970-0.974. CONCLUSION: The "Voice Wellness Index" application is an accurate and reliable tool for voice quality measurement and normal versus pathological voice screening and has considerable potential to be used by healthcare professionals and patients for voice assessment.

3.

Optimized Convolutional Fusion for Multimodal Neuroimaging in Alzheimer's Disease Diagnosis: Enhancing Data Integration and Feature Extraction.

Odusami, Modupe; Maskeliunas, Rytis; Damasevicius, Robertas.

J Pers Med ; 13(10)2023 Oct 14.

Artigo em Inglês | MEDLINE | ID: mdl-37888107

RESUMO

Multimodal neuroimaging has gained traction in Alzheimer's Disease (AD) diagnosis by integrating information from multiple imaging modalities to enhance classification accuracy. However, effectively handling heterogeneous data sources and overcoming the challenges posed by multiscale transform methods remains a significant hurdle. This article proposes a novel approach to address these challenges. To harness the power of diverse neuroimaging data, we employ a strategy that leverages optimized convolution techniques. These optimizations include varying kernel sizes and the incorporation of instance normalization, both of which play crucial roles in feature extraction from magnetic resonance imaging (MRI) and positron emission tomography (PET) images. Specifically, varying kernel sizes allow us to adapt the receptive field to different image characteristics, enhancing the model's ability to capture relevant information. Furthermore, we employ transposed convolution, which increases spatial resolution of feature maps, and it is optimized with varying kernel sizes and instance normalization. This heightened resolution facilitates the alignment and integration of data from disparate MRI and PET data. The use of larger kernels and strides in transposed convolution expands the receptive field, enabling the model to capture essential cross-modal relationships. Instance normalization, applied to each modality during the fusion process, mitigates potential biases stemming from differences in intensity, contrast, or scale between modalities. This enhancement contributes to improved model performance by reducing complexity and ensuring robust fusion. The performance of the proposed fusion method is assessed on three distinct neuroimaging datasets, which include: Alzheimer's Disease Neuroimaging Initiative (ADNI), consisting of 50 participants each at various stages of AD for both MRI and PET (Cognitive Normal, AD, and Early Mild Cognitive); Open Access Series of Imaging Studies (OASIS), consisting of 50 participants each at various stages of AD for both MRI and PET (Cognitive Normal, Mild Dementia, Very Mild Dementia); and whole-brain atlas neuroimaging (AANLIB) (consisting of 50 participants each at various stages of AD for both MRI and PET (Cognitive Normal, AD). To evaluate the quality of the fused images generated via our method, we employ a comprehensive set of evaluation metrics, including Structural Similarity Index Measurement (SSIM), which assesses the structural similarity between two images; Peak Signal-to-Noise Ratio (PSNR), which measures how closely the generated image resembles the ground truth; Entropy (E), which assesses the amount of information preserved or lost during fusion; the Feature Similarity Indexing Method (FSIM), which assesses the structural and feature similarities between two images; and Edge-Based Similarity (EBS), which measures the similarity of edges between the fused and ground truth images. The obtained fused image is further evaluated using a Mobile Vision Transformer. In the classification of AD vs. Cognitive Normal, the model achieved an accuracy of 99.00%, specificity of 99.00%, and sensitivity of 98.44% on the AANLIB dataset.

4.

Improving Structural MRI Preprocessing with Hybrid Transformer GANs.

Grigas, Ovidijus; Maskeliunas, Rytis; Damasevicius, Robertas.

Life (Basel) ; 13(9)2023 Sep 11.

Artigo em Inglês | MEDLINE | ID: mdl-37763297

RESUMO

Magnetic resonance imaging (MRI) is a technique that is widely used in practice to evaluate any pathologies in the human body. One of the areas of interest is the human brain. Naturally, MR images are low-resolution and contain noise due to signal interference, the patient's body's radio-frequency emissions and smaller Tesla coil counts in the machinery. There is a need to solve this problem, as MR tomographs that have the capability of capturing high-resolution images are extremely expensive and the length of the procedure to capture such images increases by the order of magnitude. Vision transformers have lately shown state-of-the-art results in super-resolution tasks; therefore, we decided to evaluate whether we can employ them for structural MRI super-resolution tasks. A literature review showed that similar methods do not focus on perceptual image quality because upscaled images are often blurry and are subjectively of poor quality. Knowing this, we propose a methodology called HR-MRI-GAN, which is a hybrid transformer generative adversarial network capable of increasing resolution and removing noise from 2D T1w MRI slice images. Experiments show that our method quantitatively outperforms other SOTA methods in terms of perceptual image quality and is capable of subjectively generalizing to unseen data. During the experiments, we additionally identified that the visual saliency-induced index metric is not applicable to MRI perceptual quality assessment and that general-purpose denoising networks are effective when removing noise from MR images.

5.

Pareto Optimized Adaptive Learning with Transposed Convolution for Image Fusion Alzheimer's Disease Classification.

Odusami, Modupe; Maskeliunas, Rytis; Damasevicius, Robertas.

Brain Sci ; 13(7)2023 Jul 08.

Artigo em Inglês | MEDLINE | ID: mdl-37508977

RESUMO

Alzheimer's disease (AD) is a neurological condition that gradually weakens the brain and impairs cognition and memory. Multimodal imaging techniques have become increasingly important in the diagnosis of AD because they can help monitor disease progression over time by providing a more complete picture of the changes in the brain that occur over time in AD. Medical image fusion is crucial in that it combines data from various image modalities into a single, better-understood output. The present study explores the feasibility of employing Pareto optimized deep learning methodologies to integrate Magnetic Resonance Imaging (MRI) and Positron Emission Tomography (PET) images through the utilization of pre-existing models, namely the Visual Geometry Group (VGG) 11, VGG16, and VGG19 architectures. Morphological operations are carried out on MRI and PET images using Analyze 14.0 software and after which PET images are manipulated for the desired angle of alignment with MRI image using GNU Image Manipulation Program (GIMP). To enhance the network's performance, transposed convolution layer is incorporated into the previously extracted feature maps before image fusion. This process generates feature maps and fusion weights that facilitate the fusion process. This investigation concerns the assessment of the efficacy of three VGG models in capturing significant features from the MRI and PET data. The hyperparameters of the models are tuned using Pareto optimization. The models' performance is evaluated on the ADNI dataset utilizing the Structure Similarity Index Method (SSIM), Peak Signal-to-Noise Ratio (PSNR), Mean-Square Error (MSE), and Entropy (E). Experimental results show that VGG19 outperforms VGG16 and VGG11 with an average of 0.668, 0.802, and 0.664 SSIM for CN, AD, and MCI stages from ADNI (MRI modality) respectively. Likewise, an average of 0.669, 0.815, and 0.660 SSIM for CN, AD, and MCI stages from ADNI (PET modality) respectively.

6.

Pareto-Optimized Non-Negative Matrix Factorization Approach to the Cleaning of Alaryngeal Speech Signals.

Maskeliunas, Rytis; Damasevicius, Robertas; Kulikajevas, Audrius; Pribuisis, Kipras; Ulozaite-Staniene, Nora; Uloza, Virgilijus.

Cancers (Basel) ; 15(14)2023 Jul 16.

Artigo em Inglês | MEDLINE | ID: mdl-37509305

RESUMO

The problem of cleaning impaired speech is crucial for various applications such as speech recognition, telecommunication, and assistive technologies. In this paper, we propose a novel approach that combines Pareto-optimized deep learning with non-negative matrix factorization (NMF) to effectively reduce noise in impaired speech signals while preserving the quality of the desired speech. Our method begins by calculating the spectrogram of a noisy voice clip and extracting frequency statistics. A threshold is then determined based on the desired noise sensitivity, and a noise-to-signal mask is computed. This mask is smoothed to avoid abrupt transitions in noise levels, and the modified spectrogram is obtained by applying the smoothed mask to the signal spectrogram. We then employ a Pareto-optimized NMF to decompose the modified spectrogram into basis functions and corresponding weights, which are used to reconstruct the clean speech spectrogram. The final noise-reduced waveform is obtained by inverting the clean speech spectrogram. Our proposed method achieves a balance between various objectives, such as noise suppression, speech quality preservation, and computational efficiency, by leveraging Pareto optimization in the deep learning model. The experimental results demonstrate the effectiveness of our approach in cleaning alaryngeal speech signals, making it a promising solution for various real-world applications.

7.

Reliability of Universal-Platform-Based Voice Screen Application in AVQI Measurements Captured with Different Smartphones.

Uloza, Virgilijus; Ulozaite-Staniene, Nora; Petrauskas, Tadas; Pribuisis, Kipras; Blazauskas, Tomas; Damasevicius, Robertas; Maskeliunas, Rytis.

J Clin Med ; 12(12)2023 Jun 18.

Artigo em Inglês | MEDLINE | ID: mdl-37373811

RESUMO

The aim of the study was to develop a universal-platform-based (UPB) application suitable for different smartphones for estimation of the Acoustic Voice Quality Index (AVQI) and evaluate its reliability in AVQI measurements and normal and pathological voice differentiation. Our study group consisted of 135 adult individuals, including 49 with normal voices and 86 patients with pathological voices. The developed UPB "Voice Screen" application installed on five iOS and Android smartphones was used for AVQI estimation. The AVQI measures calculated from voice recordings obtained from a reference studio microphone were compared with AVQI results obtained using smartphones. The diagnostic accuracy of differentiating normal and pathological voices was evaluated by applying receiver-operating characteristics. One-way ANOVA analysis did not detect statistically significant differences between mean AVQI scores revealed using a studio microphone and different smartphones (F = 0.759; p = 0.58). Almost perfect direct linear correlations (r = 0.991-0.987) were observed between the AVQI results obtained with a studio microphone and different smartphones. An acceptable level of precision of the AVQI in discriminating between normal and pathological voices was yielded, with areas under the curve (AUC) displaying 0.834-0.862. There were no statistically significant differences between the AUCs (p > 0.05) obtained from studio and smartphones' microphones. The significant difference revealed between the AUCs was only 0.028. The UPB "Voice Screen" application represented an accurate and robust tool for voice quality measurements and normal vs. pathological voice screening purposes, demonstrating the potential to be used by patients and clinicians for voice assessment, employing both iOS and Android smartphones.

8.

A bibliometric review of grid parity, energy transition and electricity cost research for sustainable development.

Adeyemi-Kayode, Temitope M; Misra, Sanjay; Maskeliunas, Rytis; Damasevicius, Robertas.

Heliyon ; 9(5): e15532, 2023 May.

Artigo em Inglês | MEDLINE | ID: mdl-37131435

RESUMO

As the topic of sustainable development continues to prominence in global affairs, the case for renewable energy has never been stronger. To be regarded as a perfect alternative to conventional (non-renewable) energy sources in many climes, renewable energy, such as solar and wind, shows promise when considering concepts like grid parity. A significant number of studies have been devoted to understanding the concept. However, only a few studies have committed themselves to analysing the research activity carried out on it. This paper will present a bibliometric and empirical review of worldwide grid parity, energy transition, and electricity cost research. To situate the progress in this research area, a detailed search of Scopus was used to identify and situate research development in the field from 1965 until 2021. Using the data extracted from Scopus and VOSviewer for analysis, we explore different aspects of the publications, such as the volume, growth rate, and coverage of published documents, the most influential research papers and journals in this research area, and the most studied research themes in recent years. We also discuss Governmental policies in developed and developing economies that have accelerated the attainment of Grid parity in certain countries. Also, an empirical review of top-down, bottom-up, and artificial neural network approaches to evaluating grid parity was conducted. The study revealed a steady increase in the research articles focused on grid parity, energy transition, and electricity cost research from 2006. The geographic distribution of the publications shows that most of the publications on the subject originated from the USA, Germany, China, United Kingdom, and Spain, raking in 42.2% of the publications. Also, the top 7 authors with the highest document count from Scopus are from Finland, which coincidentally is one of the countries making significant progress in Grid parity attainment. Of the total document count from Scopus, only 0.02% are papers published from African Countries. Could this reluctance to publish research findings on energy transition be one of the reasons for the slow progression of sustainable energy for all in Africa? Therefore, it is imperative now more than ever for more research focusing on the attainment of grid parity, energy transition, and electricity costs for developing countries to be brought to the fore. This article provides a review of state-of-the-art research on the attainment of grid parity and energy transition with a focus on the Levelized Cost of Electricity (LCOE) models of renewable energy sources.

9.

ETISTP: An Enhanced Model for Brain Tumor Identification and Survival Time Prediction.

Hussain, Shah; Haider, Shahab; Maqsood, Sarmad; Damasevicius, Robertas; Maskeliunas, Rytis; Khan, Muzammil.

Diagnostics (Basel) ; 13(8)2023 Apr 18.

Artigo em Inglês | MEDLINE | ID: mdl-37189556

RESUMO

Technology-assisted diagnosis is increasingly important in healthcare systems. Brain tumors are a leading cause of death worldwide, and treatment plans rely heavily on accurate survival predictions. Gliomas, a type of brain tumor, have particularly high mortality rates and can be further classified as low- or high-grade, making survival prediction challenging. Existing literature provides several survival prediction models that use different parameters, such as patient age, gross total resection status, tumor size, or tumor grade. However, accuracy is often lacking in these models. The use of tumor volume instead of size may improve the accuracy of survival prediction. In response to this need, we propose a novel model, the enhanced brain tumor identification and survival time prediction (ETISTP), which computes tumor volume, classifies it into low- or high-grade glioma, and predicts survival time with greater accuracy. The ETISTP model integrates four parameters: patient age, survival days, gross total resection (GTR) status, and tumor volume. Notably, ETISTP is the first model to employ tumor volume for prediction. Furthermore, our model minimizes the computation time by allowing for parallel execution of tumor volume computation and classification. The simulation results demonstrate that ETISTP outperforms prominent survival prediction models.

10.

Accuracy Analysis of the Multiparametric Acoustic Voice Indices, the VWI, AVQI, ABI, and DSI Measures, in Differentiating between Normal and Dysphonic Voices.

Uloza, Virgilijus; Pribuisis, Kipras; Ulozaite-Staniene, Nora; Petrauskas, Tadas; Damasevicius, Robertas; Maskeliunas, Rytis.

J Clin Med ; 13(1)2023 Dec 23.

Artigo em Inglês | MEDLINE | ID: mdl-38202106

RESUMO

The study aimed to investigate and compare the accuracy and robustness of the multiparametric acoustic voice indices (MAVIs), namely the Dysphonia Severity Index (DSI), Acoustic Voice Quality Index (AVQI), Acoustic Breathiness Index (ABI), and Voice Wellness Index (VWI) measures in differentiating normal and dysphonic voices. The study group consisted of 129 adult individuals including 49 with normal voices and 80 patients with pathological voices. The diagnostic accuracy of the investigated MAVI in differentiating between normal and pathological voices was assessed using receiver operating characteristics (ROC). Moderate to strong positive linear correlations were observed between different MAVIs. The ROC statistical analysis revealed that all used measurements manifested in a high level of accuracy (area under the curve (AUC) of 0.80 and greater) and an acceptable level of sensitivity and specificity in discriminating between normal and pathological voices. However, with AUC 0.99, the VWI demonstrated the highest diagnostic accuracy. The highest Youden index equaled 0.93, revealing that a VWI cut-off of 4.45 corresponds with highly acceptable sensitivity (97.50%) and specificity (95.92%). In conclusion, the VWI was found to be beneficial in describing differences in voice quality status and discriminating between normal and dysphonic voices based on clinical diagnosis, i.e., dysphonia type, implying the VWI's reliable voice screening potential.

11.

Dialogue agents for artificial intelligence-based conversational systems for cognitively disabled: a systematic review.

Huq, Syed Mahmudul; Maskeliunas, Rytis; Damasevicius, Robertas.

Disabil Rehabil Assist Technol ; : 1-20, 2022 Nov 22.

Artigo em Inglês | MEDLINE | ID: mdl-36413423

RESUMO

PURPOSE: We present a systematic literature review of dialogue agents for Artificial Intelligence (AI) and agent-based conversational systems dealing with cognitive disability of aged and impaired people including dementia and Parkinson's disease. We analyze current applications, gaps, and challenges in the existing research body, and provide guidelines and recommendations for their future development and use. MATERIALS AND METHODS: We perform this study by applying Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) criteria. We performed a systematic search using relevant databases (ACM Digital Library, Google Scholar, IEEE Xplore, PubMed, and Scopus). RESULTS: This study identified 468 articles on the use of conversational agents in healthcare. We finally selected 124 articles based on their objectives and content as directly related to our main topic. CONCLUSION: We identified the main challenges in the field and analyzed the typical examples of the application of conversational agents in the healthcare domain, the desired characteristics of conversational agents, and chatbot support for aged people and people with cognitive disabilities. Our results contribute to a discussion on conversational health agents and emphasize current knowledge gaps and challenges for future research.IMPLICATIONS FOR REHABILITATIONA systematic literature review of dialogue agents for artificial intelligence and agent-based conversational systems dealing with cognitive disability of aged and impaired people.Main challenges and desired characteristics of the conversational agents, and chatbot support for aged people and people with cognitive disability.Current knowledge gaps and challenges for remote healthcare and rehabilitation.Guidelines and recommendations for future development and use of conversational systems.

12.

Agrast-6: Abridged VGG-Based Reflected Lightweight Architecture for Binary Segmentation of Depth Images Captured by Kinect.

Ryselis, Karolis; Blazauskas, Tomas; Damasevicius, Robertas; Maskeliunas, Rytis.

Sensors (Basel) ; 22(17)2022 Aug 24.

Artigo em Inglês | MEDLINE | ID: mdl-36080813

RESUMO

Binary object segmentation is a sub-area of semantic segmentation that could be used for a variety of applications. Semantic segmentation models could be applied to solve binary segmentation problems by introducing only two classes, but the models to solve this problem are more complex than actually required. This leads to very long training times, since there are usually tens of millions of parameters to learn in this category of convolutional neural networks (CNNs). This article introduces a novel abridged VGG-16 and SegNet-inspired reflected architecture adapted for binary segmentation tasks. The architecture has 27 times fewer parameters than SegNet but yields 86% segmentation cross-intersection accuracy and 93% binary accuracy. The proposed architecture is evaluated on a large dataset of depth images collected using the Kinect device, achieving an accuracy of 99.25% in human body shape segmentation and 87% in gender recognition tasks.

Assuntos

Processamento de Imagem Assistida por Computador , Redes Neurais de Computação , Humanos , Processamento de Imagem Assistida por Computador/métodos , Semântica

13.

Multi-Modal Brain Tumor Detection Using Deep Neural Network and Multiclass SVM.

Maqsood, Sarmad; Damasevicius, Robertas; Maskeliunas, Rytis.

Medicina (Kaunas) ; 58(8)2022 Aug 12.

Artigo em Inglês | MEDLINE | ID: mdl-36013557

RESUMO

Background and Objectives: Clinical diagnosis has become very significant in today's health system. The most serious disease and the leading cause of mortality globally is brain cancer which is a key research topic in the field of medical imaging. The examination and prognosis of brain tumors can be improved by an early and precise diagnosis based on magnetic resonance imaging. For computer-aided diagnosis methods to assist radiologists in the proper detection of brain tumors, medical imagery must be detected, segmented, and classified. Manual brain tumor detection is a monotonous and error-prone procedure for radiologists; hence, it is very important to implement an automated method. As a result, the precise brain tumor detection and classification method is presented. Materials and Methods: The proposed method has five steps. In the first step, a linear contrast stretching is used to determine the edges in the source image. In the second step, a custom 17-layered deep neural network architecture is developed for the segmentation of brain tumors. In the third step, a modified MobileNetV2 architecture is used for feature extraction and is trained using transfer learning. In the fourth step, an entropy-based controlled method was used along with a multiclass support vector machine (M-SVM) for the best features selection. In the final step, M-SVM is used for brain tumor classification, which identifies the meningioma, glioma and pituitary images. Results: The proposed method was demonstrated on BraTS 2018 and Figshare datasets. Experimental study shows that the proposed brain tumor detection and classification method outperforms other methods both visually and quantitatively, obtaining an accuracy of 97.47% and 98.92%, respectively. Finally, we adopt the eXplainable Artificial Intelligence (XAI) method to explain the result. Conclusions: Our proposed approach for brain tumor detection and classification has outperformed prior methods. These findings demonstrate that the proposed approach obtained higher performance in terms of both visually and enhanced quantitative evaluation with improved accuracy.

Assuntos

Neoplasias Encefálicas , Máquina de Vetores de Suporte , Inteligência Artificial , Neoplasias Encefálicas/diagnóstico por imagem , Humanos , Processamento de Imagem Assistida por Computador/métodos , Imageamento por Ressonância Magnética/métodos , Redes Neurais de Computação

14.

Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery.

Maskeliunas, Rytis; Kulikajevas, Audrius; Damasevicius, Robertas; Pribuisis, Kipras; Ulozaite-Staniene, Nora; Uloza, Virgilijus.

Cancers (Basel) ; 14(10)2022 May 11.

Artigo em Inglês | MEDLINE | ID: mdl-35625971

RESUMO

Laryngeal carcinoma is the most common malignant tumor of the upper respiratory tract. Total laryngectomy provides complete and permanent detachment of the upper and lower airways that causes the loss of voice, leading to a patient's inability to verbally communicate in the postoperative period. This paper aims to exploit modern areas of deep learning research to objectively classify, extract and measure the substitution voicing after laryngeal oncosurgery from the audio signal. We propose using well-known convolutional neural networks (CNNs) applied for image classification for the analysis of voice audio signal. Our approach takes an input of Mel-frequency spectrogram (MFCC) as an input of deep neural network architecture. A database of digital speech recordings of 367 male subjects (279 normal speech samples and 88 pathological speech samples) was used. Our approach has shown the best true-positive rate of any of the compared state-of-the-art approaches, achieving an overall accuracy of 89.47%.

15.

Detection of Mechanical Failures in Industrial Machines Using Overlapping Acoustic Anomalies: A Systematic Literature Review.

Qurthobi, Ahmad; Maskeliunas, Rytis; Damasevicius, Robertas.

Sensors (Basel) ; 22(10)2022 May 20.

Artigo em Inglês | MEDLINE | ID: mdl-35632297

RESUMO

One of the most important strategies for preventative factory maintenance is anomaly detection without the need for dedicated sensors for each industrial unit. The implementation of sound-data-based anomaly detection is an unduly complicated process since factory-collected sound data are frequently corrupted and affected by ordinary production noises. The use of acoustic methods to detect the irregularities in systems has a long history. Unfortunately, limited reference to the implementation of the acoustic approach could be found in the failure detection of industrial machines. This paper presents a systematic review of acoustic approaches in mechanical failure detection in terms of recent implementations and structural extensions. The 52 articles are selected from IEEEXplore, Science Direct and Springer Link databases following the PRISMA methodology for performing systematic literature reviews. The study identifies the research gaps while considering the potential in responding to the challenges of the mechanical failure detection of industrial machines. The results of this study reveal that the use of acoustic emission is still dominant in the research community. In addition, based on the 52 selected articles, research that discusses failure detection in noisy conditions is still very limited and shows that it will still be a challenge in the future.

Assuntos

Acústica , Ruído

16.

Pedestrian and Animal Recognition Using Doppler Radar Signature and Deep Learning.

Buchman, Danny; Drozdov, Michail; Krilavicius, Tomas; Maskeliunas, Rytis; Damasevicius, Robertas.

Sensors (Basel) ; 22(9)2022 May 01.

Artigo em Inglês | MEDLINE | ID: mdl-35591146

RESUMO

Pedestrian occurrences in images and videos must be accurately recognized in a number of applications that may improve the quality of human life. Radar can be used to identify pedestrians. When distinct portions of an object move in front of a radar, micro-Doppler signals are produced that may be utilized to identify the object. Using a deep-learning network and time-frequency analysis, we offer a method for classifying pedestrians and animals based on their micro-Doppler radar signature features. Based on these signatures, we employed a convolutional neural network (CNN) to recognize pedestrians and animals. The proposed approach was evaluated on the MAFAT Radar Challenge dataset. Encouraging results were obtained, with an AUC (Area Under Curve) value of 0.95 on the public test set and over 0.85 on the final (private) test set. The proposed DNN architecture, in contrast to more common shallow CNN architectures, is one of the first attempts to use such an approach in the domain of radar data. The use of the synthetic radar data, which greatly improved the final result, is the other novel aspect of our work.

Assuntos

Aprendizado Profundo , Pedestres , Animais , Humanos , Redes Neurais de Computação , Radar , Ultrassonografia Doppler

17.

Computer-Aided Depth Video Stream Masking Framework for Human Body Segmentation in Depth Sensor Images.

Ryselis, Karolis; Blazauskas, Tomas; Damasevicius, Robertas; Maskeliunas, Rytis.

Sensors (Basel) ; 22(9)2022 May 06.

Artigo em Inglês | MEDLINE | ID: mdl-35591221

RESUMO

The identification of human activities from videos is important for many applications. For such a task, three-dimensional (3D) depth images or image sequences (videos) can be used, which represent the positioning information of the objects in a 3D scene obtained from depth sensors. This paper presents a framework to create foreground-background masks from depth images for human body segmentation. The framework can be used to speed up the manual depth image annotation process with no semantics known beforehand and can apply segmentation using a performant algorithm while the user only adjusts the parameters, or corrects the automatic segmentation results, or gives it hints by drawing a boundary of the desired object. The approach has been tested using two different datasets with a human in a real-world closed environment. The solution has provided promising results in terms of reducing the manual segmentation time from the perspective of the processing time as well as the human input time.

Assuntos

Algoritmos , Corpo Humano , Computadores , Humanos , Processamento de Imagem Assistida por Computador/métodos , Semântica

18.

Reply to Nicholas et al. Using a ResNet-18 Network to Detect Features of Alzheimer's Disease on Functional Magnetic Resonance Imaging: A Failed Replication. Comment on "Odusami et al. Analysis of Features of Alzheimer's Disease: Detection of Early Stage from Functional Brain Changes in Magnetic Resonance Images Using a Finetuned ResNet18 Network. Diagnostics 2021, 11, 1071".

Odusami, Modupe; Maskeliunas, Rytis; Damasevicius, Robertas; Krilavicius, Tomas.

Diagnostics (Basel) ; 12(5)2022 Apr 27.

Artigo em Inglês | MEDLINE | ID: mdl-35626250

RESUMO

We have studied the manuscript of Nicholas et al. [...].

19.

An Ensemble Learning Model for COVID-19 Detection from Blood Test Samples.

Abayomi-Alli, Olusola O; Damasevicius, Robertas; Maskeliunas, Rytis; Misra, Sanjay.

Sensors (Basel) ; 22(6)2022 Mar 13.

Artigo em Inglês | MEDLINE | ID: mdl-35336395

RESUMO

Current research endeavors in the application of artificial intelligence (AI) methods in the diagnosis of the COVID-19 disease has proven indispensable with very promising results. Despite these promising results, there are still limitations in real-time detection of COVID-19 using reverse transcription polymerase chain reaction (RT-PCR) test data, such as limited datasets, imbalance classes, a high misclassification rate of models, and the need for specialized research in identifying the best features and thus improving prediction rates. This study aims to investigate and apply the ensemble learning approach to develop prediction models for effective detection of COVID-19 using routine laboratory blood test results. Hence, an ensemble machine learning-based COVID-19 detection system is presented, aiming to aid clinicians to diagnose this virus effectively. The experiment was conducted using custom convolutional neural network (CNN) models as a first-stage classifier and 15 supervised machine learning algorithms as a second-stage classifier: K-Nearest Neighbors, Support Vector Machine (Linear and RBF), Naive Bayes, Decision Tree, Random Forest, MultiLayer Perceptron, AdaBoost, ExtraTrees, Logistic Regression, Linear and Quadratic Discriminant Analysis (LDA/QDA), Passive, Ridge, and Stochastic Gradient Descent Classifier. Our findings show that an ensemble learning model based on DNN and ExtraTrees achieved a mean accuracy of 99.28% and area under curve (AUC) of 99.4%, while AdaBoost gave a mean accuracy of 99.28% and AUC of 98.8% on the San Raffaele Hospital dataset, respectively. The comparison of the proposed COVID-19 detection approach with other state-of-the-art approaches using the same dataset shows that the proposed method outperforms several other COVID-19 diagnostics methods.

Assuntos

Inteligência Artificial , COVID-19 , Teorema de Bayes , COVID-19/diagnóstico , Testes Hematológicos , Humanos , Aprendizado de Máquina

20.

Deep Convolutional Neural Network-Based Visual Stimuli Classification Using Electroencephalography Signals of Healthy and Alzheimer's Disease Subjects.

Komolovaite, Dovile; Maskeliunas, Rytis; Damasevicius, Robertas.

Life (Basel) ; 12(3)2022 Mar 04.

Artigo em Inglês | MEDLINE | ID: mdl-35330125

RESUMO

Visual perception is an important part of human life. In the context of facial recognition, it allows us to distinguish between emotions and important facial features that distinguish one person from another. However, subjects suffering from memory loss face significant facial processing problems. If the perception of facial features is affected by memory impairment, then it is possible to classify visual stimuli using brain activity data from the visual processing regions of the brain. This study differentiates the aspects of familiarity and emotion by the inversion effect of the face and uses convolutional neural network (CNN) models (EEGNet, EEGNet SSVEP (steady-state visual evoked potentials), and DeepConvNet) to learn discriminative features from raw electroencephalography (EEG) signals. Due to the limited number of available EEG data samples, Generative Adversarial Networks (GAN) and Variational Autoencoders (VAE) are introduced to generate synthetic EEG signals. The generated data are used to pretrain the models, and the learned weights are initialized to train them on the real EEG data. We investigate minor facial characteristics in brain signals and the ability of deep CNN models to learn them. The effect of face inversion was studied, and it was observed that the N170 component has a considerable and sustained delay. As a result, emotional and familiarity stimuli were divided into two categories based on the posture of the face. The categories of upright and inverted stimuli have the smallest incidences of confusion. The model's ability to learn the face-inversion effect is demonstrated once more.

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA